Contest based on a directed polymer in a random medium 
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We introduce a simple one-parameter game derived from a model describing the properties of a 
directed polymer in a random medium. At his turn, each of the two players picks a move among two 
alternatives in order to maximize his final score, and minimize opponent's return. For a game of 
length 71, we find that the probability distribution of the final score Sn develops a traveling wave form, 
Prob(5'n = m) = f{m — vn), with the wave profile f{z) unusually decaying as a double exponential 
for large positive and negative z. In addition, as the only parameter in the game is varied, we find a 
transition where one player is able to get his maximum theoretical score. By extending this model, 
we suggest that the front velocity v is selected by the nonlinear marginal stability mechanism arising 
in some traveling wave problems for which the profile decays exponentially, and for which standard 
traveling wave theory applies. 

PACS numbers: 02.50.-r, 05.40.-a 
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Extreme value statistics of random variables has been 
long studied by mathematicians n, 0] and physicists 
P, 0, ^. In physics, it naturally arises when studying 
thermodynamical properties of disordered systems [p|, 
and in particular, the distribution of the ground state 
energy § |. 

If the considered random variables Ei, E2, ■■■, Ej^ are 
uncorrelated, the distribution of -Emin = min^ Ei or 
^-max = niaxi Ei becomes universal for large TV, once 
properly scaled [|l|, g] . It takes the form of the Gumbel, 
Frechet or WeibuU distribution depending on the asymp- 
totic properties of the distribution of the Ei^s. However, 
in the case of strongly correlated random variables, there 
are no general results, and it is usually a formidable task 
to access to the distribution of i^min or i^max- 

In |j] , the authors study a simple model of a directed 
polymer on a Cayley tree, inspired from the original work 
of ||], but focussing on the ground state properties {i.e. 
zero temperature) . The simplest version of the model is 
defined on a Cayley tree developed over n generations, 
and with Z branches originating from each node. The 
polymer made of n bonds starts from the root node, and 
for a given path on the tree, the total length of the poly- 
mer (assimilated to its energy) is 



E, 



path 



iGpath 



(1) 



The elementary lengths h are quenched random variables 
associated to each bond of the tree and independently 
drawn from the same random distribution p{l). The hi- 
erarchical structure of the Cayley tree induces strong 
correlations between the different possible energies (or 
lengths) of the polymer. One is then interested in the 
distribution of the minimal energy E'min, J-e. the ground 
state energy. Because of these strong correlations, the 
ground state energy distribution in not expected to fall 
into one of the three universality classes arising in the 
case of independent random variables [0, M, the best 



known of them being the Gumbel distribution [Q, 0| . In 
the special case of the binary distribution 



pSij + {l-p)Si^o, pe[0,l], 



(2) 



P{1) 

the authors of B obtained an unbinding transition when 
p > Pc = 1 — Z~^, where the polymer goes from a fi- 
nite length to an extensive length (-Emin) ^ v{p)n. In 
addition, the distribution of -Emin has a traveling front 
form 



PiE^in,n) ^ fiE„ 



v(j))n), 



(3) 



where f{z) decays exponentially fast for large negative 
argument. This last property and the general theory of 
traveling waves 0, g ||, ^ Q lead to a simple selection 
mechanism for the front velocity viji) (linear marginal 
stability; see hereafter). 

In the present work, we define a game theoretical 
model, directly inspired by this directed polymer model. 
Although our model lacks any thermodynamical refer- 
ence, it is certainly related to other optimization prob- 
lems, where the notions of extreme value statistics and 
traveling front arise P|. 

Two players A and B play an alternating game of du- 
ration n, with player A starting the game. When it is his 
turn to play, a player has a choice of Z possible moves. 
Hence, the map of all possible game histories has the 
structure of a Cayley tree with Z branches originating 
from any node, and of length 2n. The i-th move by 
player A brings him the additional score at, whereas the 
next play by player B add the value hi to the score of 
player A. The score of player B is defined as the oppo- 
site of that of player A. The elementary scores a^ and 
hi are quenched random variables independently drawn 
from the same distribution p. Ultimately, the final score 
of player A is 



E/nath — 



^path — 2^ ' 
iGpath 



(4) 



The goal of player A is to maximize its final score, 
whereas player B will do his best to select his plays in 
order to minimize the score of player A, and hence max- 
imizing his own score. The two players have an a priori 
knowledge of the game tree structure so that the final 
score of player A is defined as 



Or) 



max min E, 

available choices of A available choices of B 



path • 



(5) 



From now, we specialize to the case Z = 2, although 
our results can be easily extended to any Z. Moreover, 
we restrain ourselves to the elementary score distribution 
given by Eq. (0). It should be emphasized that the play- 
ers do not pick their next play in order just to maximize 
its local outcome {i.e. A picking its next move among 
available branches with Oi = 1 or _B picking the min- 
imum available bi). If the players were adopting such 
a simple depth-0 strategy, which would be their natural 
approach if they did not have the prior knowledge of the 
Ci's and &i's distribution over the tree, the final score 
of player A would be simply the sum of n independent 
variables of mean p^ + 2p{l — p) {A picks a branch with 
Oi = 1, if there is one available), and n variables of mean 
p^ (B picks a branch with bi = 0, if there is one avail- 
able) . Then the distribution of 5'„ would be a Gaussian 
(of width a ~ \/n), and mean {Sn) = vo{p)n, with 



vo(p)=p^ + 2p{l-p)+p^ 



2p. 



(6) 



Note that this result is identical to the score velocity 
obtained if the players had picked their move at random: 
the depth-0 strategies of both players exactly annihilate. 
Instead, having a global view of the game tree, the players 
will try to direct the game into favorable branches for 
them, in order to maximize their final score, even if they 
may have sometimes to pick an unfavorable local move 
(oi = for player A, bi = 1 for player B) in order to 
achieve their goal. For p > 1/2, there are more bonds 
with a,; = 1 or 6i = 1, so that we expect that the objective 
of player A should be easier to achieve than that of player 
B. Hence, we anticipate that (5„) = v{p)n, with 



v{p) > vo{p) = 2p, 



1 
P^2- 



(7) 



In the opposite case p < 1/2, the above inequality is 
obviously reversed. In fact, by exchanging the roles of A 
and B (and neglecting the fact that A starts the game, 
for large n) , it is clear that one has the symmetry relation 



vip)+vil-p) = 2. (8) 

In addition, one has the trivial constraints, 

i;(0) = 0, «(1/2) = 1, «(1) = 2, (9) 

which are consistent with Eq. (0). 



An intermediate strategy to the ones presented above 
corresponds to players having only a partial view of the 
game tree up to a finite depth. For instance, if the players 
have the knowledge of there next available move, and of 
their opponent's ensuing options, they should adopt the 
following depth-1 strategy: 

• Flayer A: if the options of player A are equal (both 
ai = or 1), A picks the branch for which the num- 
ber of bi equal to 1 (when it will be the turn of B to 
play) is maximal. If only one branch corresponds 
to ai ~ 1, A choscs this move. 

• Player B: if the options of player B are equal (both 
bi = or 1), B picks the branch for which the 
number of a^+i equal to 1 is minimal. If only one 
branch corresponds to bi = 0, B picks this move. 

After a elementary but cumbersome calculation, wc find 
that the score velocity vi {p) corresponding to this depth- 
1 strategy is given by. 



vi (p) = 2p 



2 7 - 6p + 4p2 _ I4p3 _^ i^pi _ 4p5 



l + 2p + 6p^ - 16p3 



(10) 



One has wi(p) > voijj) for p > 1/2, and ui(p) satisfies 
the symmetry relation of Eq. (ra) and the conditions of 
Eq. (p). For higher but finite strategy depth, an analyt- 




FIG. 1: (Color online) Sn+i can be obtained recursively from 
four optimized scores 5„ (fc = 1, 2, 3, 4), and finding the next 
optimized move from player B and then from player A. 

ical treatment becomes extremely complicated. 

Let us now move back to our model, where both players 
have a global knowledge of the game tree (infinite depth) . 
Obtaining the (not necessarily unique) optimal path re- 
alizing both players antagonist goals can be achieved by 
using the recursive minimax algorithm p2[ , which gives a 
more precise meaning to Eq. dq). Let us assume that we 
have generated four instances of optimized scores Sn 
(k = 1,2,3,4) on four independent games of length n 

(with the initial condition Sn = 0, for n = 0). In or- 
der to construct an optimized score for a game of length 
n + 1, we first generate two intermediate scores including 



the previous move of player B (see Fig. |l|), 

Rl^^ = mill (^SlP + 61, 5(2) + 62^ ^ 

i?^^) = min (^(3) + 63, 5(4) + 64) . (11) 

Then the final score is obtained by optimizing the first 
move of player A over his two possible plays, ai and 02 
(see Fig. I): 



Sn+i = max ( i?(i) + ai,Rl^'> 



a2 



(12) 



Using Eqs. (|11|,^, we can derive the corresponding re- 
cursion relations for the cumulative distribution of 5'„ 
and Rn, 



Pn{m) = Prob(S'„ <m), 
Qn{m) = Prob (i?„<m), 



(13) 



and starting from the initial condition Pn{m) ~ 1 for 
TO > 0, and P„{m) = for m < 0. Defining q = 1 — p, 
we find 

Qn{m) = 1 - {1 - qPn{m) - pP^{m - l)f (14) 



P„+i(m) = {qQ„{m) + pQn{m - I)) 



(15) 



The intermediate distribution Qnirn) can be eliminated 
by inserting Eq. ( p^ into Eq. (p!5|), leading to a single 
recursion relation between Pn+i and P„. The probability 
density of Sn is defined as 

p„(m) = P„(to) - P„(m - 1). (16) 

We look for a traveling wave form for P,i(m) 

Pr,{m) = F{m - (5„)), (5„) = w(p)n, (17) 

with the boundary conditions F{z) -^ 1, for z — > +00, 
and F(z) — > 0, for z —f —00. The probability density of 
Sn has a similar traveling wave form, associated to the 
hull function f{z): 

Pn{m)^f{m-{Sn)), f{z)^F{z)~F{z-l). (18) 

Inserting this ansatz into Eqs. dl4,Qq), we find that F 
satisfies the following functional equation 



VP{z - v) = l-q[l-qF{z)-pF{z-l)f 

- p [I - qF (z - I) -pF{z-2)]\ (19) 

where we have used the shorthand notation v for v{p). 
By retaining the leading contributions in Eq. ( [l9| ) for 
z -^■ — c», and for u > 0, we find 

F{z-v) ^iq'^F^iz), (20) 

which leads to the double exponential asymptotics 

i^(^)~/(^)^^exp(-«_2^), (21) 



where a_ > is an unknown p-dependent constant. Sim- 
ilarly, in the opposite limit 2 —f -l-oo, and assuming w < 2, 
the functional equation Eq. (noh reduces to 



1 - F(z - v) - 2p3(l _ F{z - 2))2, 
which again leads to a double exponential decay 



1-F(z- 1) 



/(^) 



1 
2^ 



exp (— q:_|_2 2- 



(22) 



(23) 



where a+ > is some p-dependent constant. Hence, and 
contrary to the standard traveling wave theory |^, ^, ||, 
where the traveling front exponential decay for z -^ —00 
or z —f -|-oo permits the determination of the front veloc- 
ity, v{p) remains so far undetermined. Here, the double 
exponential decay obtained on both sides results from the 
minimax constraint, instead of the usual min (or 77102;) 
constraint imposed when considering the ground state 
energy or the minimum (or maximum) path length dis- 
tribution 1^, ||. This fast decay of f{z) for z -^ ±00 
and the traveling wave form of Eq. (nS) ensure that 
{{Sn — v{p)nY\ remains bounded when n -^ +00. 

v{p) can still be determined numerically, from its def- 
inition {Sn) = v{p)n. The results are shown on Fig. 0, 
along with vo{p) and vi{p) which correspond to depth-0 
and depth- 1 strategies respectively. The main feature of 
v{p) is the existence of a critical value of p (denoted Pc), 
above which the score front velocity is v{p) — 2 (note 
that one also has v[{l) = 0). Moreover, and as men- 
tioned above, vq{p) is a lower bound of v{p). Finally, for 
p close to 1/2, v{p) grows linearly with p, with 



«o(l/2) 
^'(1/2) 

v[{l/2) 



= 2, 



2.123(1), 
37 



16 



= 2.3125. 



(24) 



In the symmetric case p ~ 1/2, we find that (5„) — 
n + tToi where cto = 0.14291695... is a strictly positive 
constant, illustrating the slight advantage that A gains 
from starting the game. 

Let us give a physical explanation for the occurrence 
of this transition. As p > 1/2 increases, the number of 
paths along which all the a^ 's and hi 's are equal to 1 grows 
exponentially. Indeed, the probability of having such a 
path is p^", so that their total number in the tree is of or- 
der 2^"xp^''. The existence of this transition shows that 
for p > pc, player A is able to chose his moves in order 
to force the outcome of the game to follow one of these 
path, with probability unity, as n — *■ 00. Symmetrically, 
for p < 1 — Pc , player B will find enough branches along 
which most a^'s and 6i's are equal to 0, in order to en- 
force that the front velocity remains zero in this regime, 
consistently with the symmetry relation of Eq. (||). 

This transition can be understood analytically, by 
studying the stability of a traveling front of velocity 






2 


- 




^^ 


1 


1 ' 1 ' 








































1.8 


- 






/ 




1.6 


— 








'^ _ 










/ /v^ 




1.4 


- 






^y<y 
z/> 


- 








^ 


^ 




1.2 
1 


z 


/ 


'■iT 




, 1 , 



FIG. 2: (Color online) We plot the score velocity front t)(p) 
(full line), the lower bound «o(p) = 2p (lower dashed line), 
and «i(p), obtained when the players adopt a depth- 1 strat- 
egy (upper dashed line). For p < 1/2, ii(p) is obtained by 
symmetry, using Eq. (^j. The heuristic expression of Eq. ( |3C| ) 
cannot be distinguished from the numerical data at this scale. 



v{v) = 2. Noting that Pn{2n) = Qn{2n + 1) = 1 (since 
Sn < 2n and i?„ < 2n + 1), we consider the dynamics of 
Un = Pn{2n — 1) < 1. From Eqs. ( [l^ , p^ , u„ satisfies the 
recursion relation 



Un+l 



[l-p^l-Unf 



(25) 



with uq = 0. Un = Pn{'2,n — 1) = 1 is an obvious fixed 
point of Eq. (gSJ), corresponding to the case w(p) < 2. If 
this fixed point is selected, and after linearizing Eq. (25), 
we find that 



ln(l-P„(2n-l))~-2'^ 



(26) 



which is fully consistent with Eq. (E3h, with z = 2n — 
1 — v{p)n. However, the recursion relation of Eq. (E3) 
has three other fixed points (x-,x-^.,XQ), which can be 
obtained analytically by solving a third order polynomial 
equation, leading to cumbersome expressions. A detailed 
analysis shows that xq is always real, with xq > 1. This 
fixed point is unphysical and necessarily unstable. The 
two other fixed points are real for p > Pc (2;- < 2:+), 
with 



Pc 



. 2^/3 == 0.94494. 



(27) 



and complex conjugates for p < pc- Moreover, one finds 
that X- is maximal for p = pc, at which x± — 1/9. 
Finally, a stability analysis shows that x- is the only 
stable fixed point for pc < P < 1- Hence, we conclude 
that Pn(2n — 1) converges (exponentially fast) to X- for 
Pc < P < 1, and that the distribution of Sn is peaked 
near m ^ 2n and decays as a double exponential for 
m <C 2n (as given by Eq. (|2l|)), leading to v{p) = 2. 
The obtained value of pc is in perfect agreement with 
the numerical results for v{p) plotted on Fig. 0. Close to 



p = 1, X- ^ 9(1 — p)^ -^ 0, and up to second order in 
(1 ~ p), the distribution of Sn is thus given by 

p„(2n) = l-9(l-p)2, pni2n - 1) = 9{l - p)\ (28) 

Note that if both players adopt a finite depth strategy, 
this transition does not occur, as illustrated in Fig. g 
in the case of depth-0 and depth- 1 strategies considered 
above. By adopting a short-sighted strategy, player A 
(respectively B) cannot direct, with probability 1, the 
sequence of plays to a branch of the tree with a density 
unity of playing options a; = 6^ = 1 (respectively a^ = 

h = 0). 

Finally, for p below but close to pc, we obtain a very 
convincing fit of v{p) to the functional form 



v{p)^2-c{p,-py'' + 



(29) 



with c w 0.50(1), leading to an infinite slope for v(j)) 
a.t p ~ p^ , as found numerically on Fig. 0. In fact, for 
p > 1/2, we find that the simple heuristic functional form 



v{p) = 2-2 



{pc -p)(l -p) 
2pc-l 



1/2 



(30) 



fits the data with a relative accuracy better that 0.1%, 
comparable although slightly higher than the estimated 
numerical error bars of the data. This functional form 
ensures that t'(l/2) = 1 and that the behavior of Eq. (p9) 
is reproduced, and leads to the heuristic values. 



v'{0) ^ 2.12099. 



c = 0.49748... 



(31) 



in good agreement with the numerical estimates pre- 
sented above. 

Let us now address the properties of the hull function 
/(z), and its cumulative sum F{z). First of all, if for a 
given p, the corresponding v{p) happens to be a rational 
number v(j)) = a/f3 {a and (3 being mutually prime), 
Eq. (|lj) implies that the hull function is only defined 
on the discrete set of fractions of the form k//3. This is 
in particular the case for p = 1/2 (f (1/2) = 1), p > pc 
{v{p) = 2), and p < 1 — Pc {v{p) = 0). On the other 
hand, when v(jj) is irrational, the set of points of the 
form z — rn ~ v(j))n is dense on the real axis, and f{z) 
is a continuous function defined on the real axis. As 
v{p) approaches v{pc) = 2 from below, the hull function 
/(z) develops steps which blend into discontinuities as 
p —^ Pc- This property is illustrated on Fig. ^, along with 
the asymptotics obtained in Eqs. ( ^ , p3| ). 

We now extend our original model in order to gain 
some insight about the velocity selection mechanism. 
This is achieved by modifying the model so that the 
standard theory of front propagation will apply. In this 
[A, £)-model, player A always follows its best strategy, 
while player B follows the depth-0 strategy (picking a 
branch with 6^ = if available) with probability e and 




FIG. 3: (Color online) We plot the hull function f{z) for 
p — 0.94495 > Pc, for which v{p) — 2, and which is defined 
on negative integer values of z (dots linked by a dotted line) . 
For p = 0.9449 slightly below pc {v{p) = 1.99684...), the hull 
function is continuous but exhibits smooth steps at integer 
values of z (corresponding thin line). Finally, for p — 0.75 
{v{p) — 1.53146...), we plot the continuous hull function (thick 
line) along with the predicted asymptotics of Eqs. (E11g3) 
(dashed lines). 



its optimal strategy with probability 1 — e. The origi- 
nal model is hence recovered for e = 0. The recursion 
relation of Eq. (|lj) now becomes 

Qn{m) = (1 - e) (l - (1 - qPn{m) - pP„{m - 1))') 
+e ((g2 + 2pq) P„{m) +p'Pn{m - 1)) , (32) 



while Eq. (15) remains unchanged. We denote the associ- 
ated front velocity by VAiv, ^)- Note that if B were play- 
ing purely randomly, the terms q'^ + 2pq and p^ in Eq. ( [3^ ) 
must be replaced respectively by q and p. This model 
has exactly the same qualitative properties as the {A, e)- 
model, on which we hence concentrate, since player B 
adopts a more intelligent strategy (some results obtained 
for the random model will be mentioned in passing). In 
the {B, e)-model, one exchanges the role of players A and 
B, and Eq. (15) is changed into 



Pn+i{m) = (1 - e) {qQn{rn) + pQn{m - 1)) 

+e {q^Qn{m) + {p" + 2pq) Q^{m - 1)) , (33) 

whereas Eq. (|4|) still holds. When exchanging the role 
of the two players, the associated front velocity vb{p,£) 
satisfies the symmetry relation. 



VAip,e) +vb{1 -p,e) 



(34) 



which reduces to Eq. (H), when e = 0. However, for 
£ > 0, va{p, e) and ws(l — p, e) do not obey the symme- 
try relation of Eq. (ph. In addition, we have the exact 
inequalities. 



since players B and A are respectively playing less opti- 
mally in the {A, e) and {B, e)-models than in the original 
model. 

Because of the symmetry relation of Eq. (|3J), we focus 
on the {A, e)-model, and denote the associated velocity 
simply by v. The {A, e)-inode\ main interest lies in the 
fact that for e > 0, P„(to) = 1 — P„(m) decays expo- 
nentially for m 3> wn, so that the standard mechanisms 
of front velocity selection do apply (see below). When 
P„(to) <C 1, the recursions of Eqs. (p^j32|) indeed lead to 



Pn+iim) = 2e [il+p)il-pfPn{m) 
+p{l - p){2p + l)P„(m - l)+p^Pn{m - 2)] , (36) 

or equivalently, the front profile F{z) = I — F{z) satisfies 

F{z - v) = 2e [{l+p){l - pfF{z) 
+p{l-p){2p+l)F{z-l)+p^F{z-v)\ , (37) 



to be compared to Eq. (E2f), for the original model. The 
simple ansatz F(z) ~ exp(— Az) leads to the dispersion 
relation 



v{\) = ^\n[2e{l-p 



pc 



[l-p' 



-pV)], (38) 



where the decay rate A is so far undetermined. 

Let us now summarize the main known mechanisms of 
velocity front selection |^, ||, ||, |l^, |l^ for exponentially 
fast decaying profiles. In many physical cases, including 
those studied in 0,0], a linear marginal stability (LMS) 
argument shows that the selected front velocity corre- 
sponds to the minimum velocity Wmin allowed by the dis- 
persion relation w(A), associated to the decay rate Amin- 
However, in some other cases 0, ITol, O, a bigger veloc- 
ity is selected by a nonlinear marginal stability (NLMS) 
mechanism. Without entering into too much details, let 
us briefly explicit this point. Consider the large z asymp- 
totics of a solution of the full nonlinear problem associ- 
ated to the velocity w, 



F{z) - Ai{v) e-^i(^)^ -f A2{v) e'^^^")^ -h 



(39) 



VB{p,e) < v{p) < VA{p,e), 



(35) 



with Xi(v) < Amin given by the dispersion relation de- 
rived from linear analysis. Note that the above linear 
analysis does not grant access to Ai{v), not to men- 
tion the correction term proportional to A2{v). Now, 
if there exists a velocity w, > Wmin for which Al{v.^,) = 0, 
F{z) will decay more sharply with rate A2(w*), which is 
necessarily another root of the dispersion relation, with 
X2{v*) > Amin- It Can be shown that all traveling fronts 
with velocity less than v^, are then unstable against inva- 
sion by a profile of velocity v^, , which leads to the selec- 
tion of the velocity front d* , instead of v^i^ 0. |lO|, O . In 
practice, there arc very few examples for which the tran- 
sition between a linear and a nonlinear marginal stability 
scenario can be analytically identified, since it requires in 



general a full solution of the profile associated to a ve- 
locity V, in order to obtain Ai[v). To the knowledge of 
the author, all such tractable examples concern traveling 
front in the spatial and temporal continuum [pi nO, ni|, 
like for instance, 



dP 

'dt 






P(1-P)(1 + kF), 



(40) 



for K > 1. In this case y, |l^, v[X) = A + A ^, so that 

I'min = 2 and Amin = 1- f^min IS SClcctcd for 1 < K < 

2, whereas w = w, = {K./^f/'^ + (k/2)-i/2 (with Ai = 
(k/2)-i/2 and A2 = (k/2)1/2), for n>2. 

Returning to our {A, £)-niodel, we find that a non triv- 
ial Wmin exists for any e > 1/2. It is obtained by first 
finding Amin, the unique real positive solution of 



^'(A,, 



= 0, 



and setting w,) 
find that Wniin 



= w(Amin) in Eq. (| 
2, for p > pc, with 

p, = (2£)-i/3. 



(41) 
In particular, we 

(42) 



In the case e = 1, when player B always adopts the 
depth-0 strategy, we find Pc = 2-^''^ = 0.7937005... 
Hence, we obtain the same kind of transition as in the 
original model, where player A is able to get its maximum 
theoretical score. However, since B has a short-sighted 
strategy, we do not observe a transition to w = for 
small but non zero p, as obtained in the original model 
for p = \ — Pc- We actually find Umin ^ — ^K V , when 
p ^ 0. Note that if B plays randomly instead of adopting 
the depth-0 strategy, wc obtain the dispersion relation 



v{X) = jln 



2£ (1 - p + pc^ 



(43) 



andpc = (2e)^i/2. 

However, for a given p, we find numerically that the 
velocity given by the LMS mechanism Wmin is only se- 
lected for £ > £c(p), so that the results of Eqs. (|8|,|l]J42|) 



are only valid for e close enough to 1. For 1/2 < £ < 
£c(p) < 1, and although a non trivial iimin does exist, 
we find V > Wmin- This strongly suggests the relevance 
of the NLMS mechanism in this case. Unfortunately, for 
1/2 < £ < £c(p) and a given v, there is very little hope 
to obtain an analytical solution of the corresponding full 
nonlinear equation for F{z), in order to apply the NLMS 
criterion explained above. Likewise, for e < 1/2, the 
minimal positive velocity is Vmin = {v{X = 0) = —00), 
and the prospect of an analytical solution appears even 
bleaker. Note however that v and the associated de- 
cay rate A are still related by the dispersion relation of 
Eq. (§). 

On the bright side, the full line Pc{s) can be determined 
exactly, by studying the dynamics of u„ = 1 — P„(2n— 1), 
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FIG. 4: (Color online) We plot the exact Pc{£) above which 
va{p,£) = 2 (full line), whose LMS and NLMS expressions 
are respectively given by Eq. (E2) and Eq. (M. We also 
plot the numerical estimate of £c(p) (thick dashed line), the 
boundary between the LMS and NLMS domains of applica- 
tion. These two curves cross at (scPc) ~ ( t, ^-2— I (dot). We 
also identify three domains according to the relation between 
the observed front velocity v and the LMS velocity Umin- 



in the same spirit as in the case £ = 0. Wc find that u^ 
satisfies the exact recursion relation 



Un+l = gp,eiUn) = Zn(2- Zn), 
Zn = P^Unie+ (l-£)u„). 



(44) 
(45) 



with uo = 1. Wc then determine the value oipc{e) above 
which there exists a stable non trivial fixed point u^{e) 7^ 
0. For £ > 4/5, we find that Pc{£) is indeed given by the 
LMS argument, leading to the result of Eq. (B3). On 
the other hand, for < £ < 4/5, a regime were the 
NLMS mechanism should be relevant, pd^) and u*(£) 
are determined in the same spirit as in the case £ = 0, by 
imposing that gp^,e{u*)/ut — 1 = g'p^,,{u^,) — 1 = 0. We 
find 



Pc(e) 



(l„£)l/2(4„e)3/2_8 + 7£ + . 



2£2 



1/3 



(46) 



which goes smoothly to the result of Eq. ( |27| ) , when e -^ 
0. 

Interestingly, this analysis provides the exact value of 
£c for the corresponding value of p = Pci^c) ~ (2£c)^^^'^, 
at the transition between the LMS and NLMS regimes. 
We thus find 



P 



5IA3 
2 



0.854988. 



(47) 



If B plays randomly instead of adopting the dcpth-0 
strategy, one obtains 



1 



= 0.809017... = 3 - VS = 0.763932. 



(48) 




FIG. 5: (Color online) We plot the velocities va{p,b) (three 
upper dashed lines; e = 1, 4/5, 1/4 from top) and VB{p,e) 
(three lower dashed lines; e = 1, 4/5, 1/4 from bottom). The 
full line corresponds to v[p) for the original model (e = 0). 



model, this frustration originates from the minimax con- 
straint, which is, however, quite uncommon in physics. 
As a consequence, the present model has no thcrmo- 
dynamical interpretation. We found that the score dis- 
tribution develops a traveling wave form, with the hull 
function unusually decaying superexponentially for large 
negative and positive arguments. We have justified an- 
alytically the occurrence of a transition, across which a 
player can obtain his maximum theoretical score, what- 
ever the strategy of the other player. Contrary to systems 
for which the standard traveling wave theory applies, we 
did not succeed in understanding analytically the process 
which leads to the selection of the velocity front. How- 
ever, after studying an extension of the original model, we 
strongly suggest that the selection mechanism is related 
to nonlinear marginal stability, arising in some traveling 
wave problems for which the profile decays exponentially. 



In Fig ^, we plot our exact result for Pc{e) and a nu- 
merical estimate oi edp), the boundary between the LMS 
and NLMS domains of application. In Fig |^, we plot 
va{p,£) and vb{p,s) for e = 1, where B and A are re- 
spectively adopting the depth-0 strategy, for e = 4/5, 
the smallest e for which the LMS mechanism applies for 
all p, and for e = 1/4, for which the NLMS mechanism 
holds for all p. We observe numerically that va (p, e) and 
vb(Pt£) converge smoothly to v(jp) as e — > 0. 

Let us finally address the subleading corrections to the 
average score (S'n). The LMS mechanism implies W pTJ ] 
that 



{S„ 



2An 



Inn 



(49) 



In the {e,p) regime where the LMS mechanism applies, 
we have confirmed numerically the occurrence of this log- 
arithmic correction, as well as its magnitude. In the 
NLMS regime, we find instead that the next correction 
to {Sn) = vn is constant. Quite generally, this result can 
be justified analytically whenever v > Wmin, in particular 
when the NLMS mechanism applies [|l3|. This property 
was exploited in order to obtain the numerical estimate 
of Sc{p) shown in Fig U, and this criterion is found to be 
fully consistent with defining edp) as the value of e for 
which V becomes equal to the velocity Umin selected by 
the LMS mechanism (see Eqs. (|38|^,||)). 

In the present work, we have defined a simple two- 
player game inspired by a model of directed polymer on 
the Cayley tree. The fact that the two players have an- 
tagonist goals is reminiscent of the notion of frustration 
quite conunon in disordered physical systems. In our 
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